Improving Document Transformation Techniques with Collaborative Learned Term-Based Concepts

نویسنده

  • Stefan Klink
چکیده

Document Transformation techniques have been studied for decades. In this paper, a new approach for a significant improvement is presented based on using a new query expansion method. In contrast to other methods, the regarded query is expanded by adding those terms that are most similar to the concept of individual query terms, rather than selecting terms that are similar to the complete query or that are directly similar to the query terms. Experiments have shown that Document Transformation techniques are significantly improved in the retrieval effectiveness when measuring the recall-precision.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Document Retrieval by Automatic Query Expansion Using Collaborative Learning of Term-Based Concepts

Query expansion methods have been studied for a long time – with debatable success in many instances. In this paper, a new approach is presented based on using term concepts learned by other queries. Two important issues with query expansion are addressed: the selection and the weighing of additional search terms. In contrast to other methods, the regarded query is expanded by adding those term...

متن کامل

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

Collaborative Learning of Term-Based Concepts for Automatic Query Expansion

Information Retrieval Systems have been studied in Computer Science for decades. The traditional ad-hoc task is to find all documents relevant for an ad-hoc given query but the accuracy of ad-hoc document retrieval systems has plateaued in recent years. At DFKI, we are working on so-called collaborative information retrieval (CIR) systems which unintrusively learn from their users search proces...

متن کامل

TCL - An Approach for Learning Meanings of Queries in Information Retrieval Systems

The accuracy of ad-hoc document retrieval systems has plateaued in the last years. At DFKI, we are working on so-called collaborative information retrieval (CIR) systems which unintrusively learn from their users search processes. For a first step towards techniques, we focus on a restricted setting in CIR in which only old queries and correct answer documents to these queries are available for...

متن کامل

A Multiagent Framework for Collaborative Conceptual Learning Using a Dempster-Shafer Belief System

In this paper, we describe a multiagent framework for collaborative conceptual earning using a Dempster-Shafer belief system in the domain of information retrieval. In our multiagent system, each agent maintains a database of documents, entertains different queries from its users, and thus learns a unique dictionary of concepts. Filed for each concept is a set of keywords collected from the doc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004